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Abstract 

It is believed that strong ferromagnetic orders in some solids are generated by subtle 
interplay between quantum many-body effects and spin-independent Coulomb inter- 
actions between electrons. Here we describe our rigorous and constructive approach 
to ferromagnetism in the Hubbard model, which is a standard idealized model for 
strongly interacting electrons in a solid. We introduce a class of Hubbard models in 
any dimensions which are nonsingular in the sense that both the Coulomb interaction 
and the density of states (at the Fermi level) are finite. We then prove that the ground 
states of the models exhibit saturated ferromagnetism, i.e., have maximum total spins. 
Combined with our earlier results, the present work provides nonsingular models of 
itinerant electrons with only spin-independent interactions where low energy behaviors 
are proved to be that of a "healthy" ferromagnetic insulator. 
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1 Introduction 



The origin of strong ferromagnetic order observed in some solids has long been a mystery in 
physical science. After Heisenberg [1], it became clear that the ultimate origin of ferromag- 
netism lies in a subtle interplay between quantum many-body effects and strong Coulomb 
interaction between electrons. To provide convincing derivations of ferromagnetism in con- 
crete models of many electrons, however, remained unsolved (even on a heuristic level) for a 
long time. 

The problem is difficult because neither quantum many-body effects, nor the Coulomb 
interaction alone favors ferromagnetism (or any magnetic ordering). One must deal with an 
interplay of both the factors. Moreover, the intrinsically nonperturbative nature of the phe- 
nomenon makes the problem almost impossible to attack within conventional perturbative 
methods in condensed matter physics. A generic many-electron system without interactions 
is known to have a paramagnetic ground state, a phenomenon known as Pauli paramag- 
netism. In order to destabilize Pauli paramagnetism and stabilize ferromagnetism, one must 
have a sufficiently large interaction. For example, a heuristic argument due to Stoner im- 
plies the criterion that U Dp ^ 1 is necessary to stabilize ferromagnetism, where U is the 
on-site Coulomb interaction and Dp is the density of states at the Fermi level 2 . This is the 
well-known "competition" between quantum dynamics and Coulomb interaction. 

In the present paper, we describe our constructive and mathematically rigorous approach 
to the origin of ferromagnetism. This is a continuation of the series of works [2, 3, 4, 5], and 
the main result of the present paper was described in [6] for a special one-dimensional model. 
Here we present a class of Hubbard models in any dimensions with a finite density of states (at 
the Fermi level) and finite interactions, and prove that their ground states are ferromagnetic. 
Combined with our earlier work [4, 5], this provides a class of nonsingular models of itinerant 
electrons (with only spin-independent interactions) in which low energy behaviors (i.e., the 
nature of the ground states and the low-lying excitations) are rigorously proved to be those 
expected in ferromagnetic insulators. We hope that the present work becomes a starting 
point of further investigations of deep interplay between quantum dynamics and nonlinear 
interactions in strongly interacting quantum many-body systems. 

The present paper is written in a nearly self-contained manner. In Section 2, we give 
the definition of the Hubbard model. In Section 3, we briefly review rigorous results about 
ferromagnetism in the Hubbard model, and motivate the present paper. In Section 4, we 
summarize, in a typical class of models, the main results of our constructive program in the 
present and the previous works of ours. The reader who is interested in the new physical 
results is invited to start from this section. In Section 5, which is the main section of the 
paper, we define our models in the most general setting and state our conclusions precisely. 

2 This is only a heuristic criterion, and there are many counterexamples. 
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Final section 6 is devoted to the proof of the main theorem. 

2 Definition of the Hubbard model 

The Hubbard model is a standard simple model of interacting itinerant electrons in a solid. 
Although this model is too idealized to be regarded as a quantitatively reliable model of 
real solids, it contains physically essential features of interacting itinerant electron systems. 
It is expected to exhibit various phenomena including antiferromagnetism, ferromagnetism, 
ferrimagnetism, superconductivity, and metal-insulator transition. Some (but not all) of 
these phenomena have been treated rigorously in some cases [7]. 

In the present section, we define the Hubbard model in the general setting, and fix our 
notation. We leave details and backgrounds to more careful reviews (such as [7, 8, 9]) and 
present only the minimum necessary ingredients. 

2.1 Basic operators 

Let a lattice A be a finite set whose elements r, s, • • • G A are called sites. A site represents 
an atomic orbit in a solid. 

For each rGA and a =T> l> we define the creation and the annihilation operators 
c\ a and c rjCT for an electron at site r with spin a. These operators satisfy the canonical 
anticommutation relations 

H,*, C s,t} = Sr, s S a , T , (2.1) 

and 

{< ff ,< T } = {c r , ff ,c 1 ,, T } = 0, (2.2) 

for any r, s G A and a, r =|, |, where {A, B} = AB + BA. The number operator is defined 
by 

n r,a = c r ,a C r,Ti (2-3) 

which has eigenvalues and 1. 

The Hilbert space of the model is constructed as follows. Let $ vac be a normalized vector 
state which satisfies c r;(T $ vac = for any rGA and o =|, \. Physically $ vac corresponds to 
a state where there are no electrons in the system. Then for arbitrary subsets A-f,A^ C A, 
we define a state 3 

(n<t) (n^Kac, (2.4) 

VfA / \ reA i / 

3 Throughout the present paper, we assume that the sites in the lattice are ordered (in an arbitrary but 
fixed manner), and products of fermion operators respect the ordering. 
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in which sites in Af are occupied by up-spin electrons and sites in Aj_ by down-spin electrons. 
The Hilbert space for the system with N e electrons is spanned by the basis states (2.4) with 
all subsets Af and Af such that 4 |Af| + |AjJ = N e . 

We finally define total spin operators S to t = (-Stot> ^toh ^tot) by 

S& = \ E 4> (a) Wc r , T , (2.5) 

r£A 
<r,T=1,i 

for a = 1, 2, and 3. Here are the Pauli matrices defined by 

The operators S to t are the generators of 577(2) rotations of the total spin angular momentum 
of the system. As usual we denote the eigenvalue of (S to t) 2 as S tot (S tot + 1). The maximum 
possible value of S tot is N e /2 when N e < |A|. 

2.2 General Hamiltonian 

The model is characterized by the hopping amplitudes t r ^ s = t StT G H. defined for all r, s G A, 
and the magnitude U > of the on-site Coulomb interaction. Physically, t r s represents the 
quantum mechanical amplitude for an electron to hop from the site s to site r when s ^ r, 
and the on-site potential when r — s. Usually t TyS is non-negligible only when the two sites 
r and s are close to each other. 

We then define the general Hubbard Hamiltonian as 

H = E *»-,« C l,a°s,a + U Y1 n r,\ U r,i- ( 2 - 7 ) 
r,sGA rgA 

Here the first term describes quantum mechanical motion of electrons which hop around the 
lattice according to the amplitude t ryS . 

The second term represents nonlinear interactions between electrons. There is an increase 
in energy by U > for each doubly occupied site, i.e., a site which is occupied by both up- 
spin electron and down-spin electron. This is a highly idealized treatment of the Coulomb 
interaction between electrons. 

The Hamiltonian which consists only of the first term in (2.7) describes the free tight- 
binding electron model. It is not very difficult to analyze this model especially when the 

4 Throughout the present paper |5| denotes the number of elements in a set S. 
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hopping amplitude t TjS has a translation invariance. The Hamiltonian can be diagonalized 
in the states in which electrons behave as "waves." 

The Hamiltonian which consists only of the second term in (2.7) is also easy to study. 
The Hamiltonian is already diagonalized in the basis states (2.4), in which electrons behave 
as "particles." 

When both the first and the second terms in (2.7) are present, a "competition" between 
wave-like nature and particle-like nature of electrons takes place. The competition generates 
rich nontrivial phenomena including ferromagnetism. To investigate these phenomena is a 
main motivation in the study of the Hubbard model. 

3 Rigorous results about ferromagnetism in the Hub- 
bard model 

In the present section, we formulate the problem of saturated ferromagnetism in the Hubbard 
model. We then give a brief review of the rigorous results about ferromagnetism in the 
Hubbard model, and explain background of the present work. For more careful reviews, see 
[8, 9]. 

3.1 Saturated ferromagnetism in the ground states 

It is easily shown that the Hamiltonian (2.7) commutes with the total spin operators . 
Therefore one can look for simultaneous eigenstates of H and (S to t) 2 - 

When all the ground states of the Hamiltonian H (with a fixed electron number N e < 
|A|) are eigenstate of (S to t) 2 with S tot = N e /2, we say that the model exhibits saturated 
ferromagnetism. This is the strongest form of ferromagnetism since N e /2 is the maximum 
possible value for St ot . 

3.2 Ferromagnetism of Nagaoka and Thouless 

The first rigorous and nontrivial result about saturated ferromagnetism in the Hubbard 
model is due to Nagaoka [10] and to Thouless [11]. It was proved that the Hubbard model on 
a class of lattices (which includes most of the standard lattices in two and three dimensions) 
with t TyS > exhibits saturated ferromagnetism when N e = |A| — 1 and U = oo. In other 
words the model is not allowed to have any doubly occupied sites, and there is only one site 
without an electron. 

The ferromagnetism of Nagaoka and Thouless is quite important since it showed for the 
first time that the Hubbard model can generate ferromagnetism through nontrivial interplay 
between quantum dynamics and Coulomb interaction. Subsequent studies, however, have 
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Figure 1: The so called copper oxide lattice. As a consequence of Lieb's theorem [12], 
it is proved that the Hubbard model with U > on this lattice has S^ot = -We/6 when 

N e = |A|. 

suggested that their mechanism of ferromagnetism is restricted to special situation with 
infinite U and a single hole. See Section 4 of [8] for a modern proof and further discussions. 

3.3 Lieb's ferrimagnetism and flat-band ferromagnetism 

In 1989, after more than two decades from the works of Nagaoka and Thouless, Lieb proved 
an important theorem for the Hubbard model with N e = |A| (i.e., half-filling) on a bipartite 
lattice [12]. For the Hubbard model with U > on lattices which have two sublattices with 
different numbers of sites, Lieb's theorem implies the existence of ferrimagnetism, a weaker 
version of ferromagnetism. A typical example is the Hubbard model on the so called copper 
oxide lattice of Fig. 1, where the ground states are proved to have Stot = N e /6 when N e = |A|. 
The models exhibiting Lieb's ferrimagnetism have peculiar single-electron band structures 
where the band at the middle of the spectrum is completely flat (or dispersionless). One 
may regard Lieb's ferrimagnetism as a precursor to the flat-band ferromagnetism that we 
shall discuss. 

Flat-band ferromagnetism was discovered first by Mielke [13, 14, 15] and then by Tasaki 
[2, 3]. Mielke treated the Hubbard model on a general line graph, where t r>s = t > for those 
pairs (r, s) corresponding to the edges (or bonds) of the lattice, and t r ^ s = otherwise. The 
models have peculiar band structure where the lowest single-electron band is completely flat. 
Mielke proved that the models with U > exhibit saturated ferromagnetism for suitable 
electron numbers which correspond to the half-filling of the lowest bands. A typical example 
(and the most beautiful example of flat-band ferromagnetism) is the Hubbard model on 
the kagome lattice of Fig.2, which was proved to exhibit saturated ferromagnetism when 
N e = |A|/3. See also [16, 17, 18] for Mielke's results on Hubbard models with partially flat 
bands. 



7 



Figure 2: The kagome lattice is the line graph of the hexagonal lattice. Mielke [13, 
14, 15] showed that the Hubbard model on the kagome lattice exhibits ferromagnetism 
when N e = |A|/3 for any U > 0. 




Figure 3: Tasaki's flat-band Hubbard model in one dimension [2, 3]. The hopping 
amplitude t r>s is u 2 t for the horizontal bonds and vt for the diagonal bonds. The sites 
in the upper and the lower rows have on-site potential t r ^ which equal t and 2v 2 t, 
respectively. When N e = |A|/2, the model exhibits saturated ferromagnetism for any 
t > 0, v > 0, and U > 0. See Theorem 4.1. 

Tasaki [2, 3] proposed his version of Hubbard models with flat lowest bands, and proved 
the existence of saturated ferromagnetism for U > when the lowest bands are half-filled. 
As can be seen from the one-dimensional example in Fig. 3, his models have two different 
kinds of lattice sites which are sometimes interpreted as metallic and oxide atoms, and have 
next nearest neighbor hopping amplitudes. By fine-tuning the hopping amplitudes and the 
on-site potentials, the lowest band becomes flat. See [19] for an extension. 

A common feature of Lieb's ferrimagnetism and Mielke's and Tasaki's ferromagnetism is 
that their models have single-electron bands which are totally flat (i.e., dispersionless), and 
the magnetization is supported by electrons in the flat bands. (For Lieb's ferrimagnetism, 
the latter statement is correct only in a vague sense.) This observation is consistent with 
the Stoner criterion which states that large UD-p favors ferromagnetism. Here the criterion 
is realized by infinitely large density of states Dp. 

The works of Lieb, Mielke, and Tasaki have shown that rich classes of Hubbard models 
on slightly complicated lattices exhibit nontrivial magnetic behavior. Such a view may 
be helpful in understanding insulating ferromagnetism observed in a cuprate [20, 8], and 
has even motivated some scientists to design novel ferromagnetic materials. See [21] and 
references therein. But one should not forget that the Hubbard model is a highly idealized 
model. To find implications of the results for the Hubbard model in realistic many-electron 
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systems defined in continuum space is a formidably difficult but a challenging problem. See, 
for example, [21, 22]. 

3.4 Beyond flat-band ferromagnetism 

Although Lieb's ferrimagnetism and Mielke's and Tasaki's flat-band ferromagnetism cer- 
tainly have shed novel light on the mechanisms of magnetic ordering in interacting electron 
systems, they do not deal with the true "competition" between quantum dynamics and 
Coulomb interactions. When the Coulomb interaction U is vanishing, all of their models 
have highly degenerate ground states. The degeneracy reflects the existence of completely 
flat bands. Among these degenerate ground states for U — 0, there are ferrimagnetic or 
ferromagnetic states as well as states with much smaller magnetization. The role of the 
Coulomb interaction in these models is simply to lift the huge degeneracy and "select" the 
states with highest magnetization as unique ground states. Consequently ferrimagnetism or 
ferromagnetism in these models takes place for any values of U > 0. In other words magnetic 
ordering is stabilized by infinitesimally small Coulomb interaction. This is quite different 
from situations in realistic systems where the interaction must be greater than some positive 
critical value in order to destabilize Pauli paramagnetism and get magnetic ordering. 

It may be needless to say that the existence of completely flat lowest bands (especially in 
Tasaki's models) is unrealistic, or even pathological. The flatness of the bands is destroyed 
by arbitrarily small generic perturbation, and is far from robust. 

It was therefore highly desirable to go beyond flat band models. A natural step was 
to modify the model by adding extra hopping terms to the Hamiltonian thus making the 
flat band dispersive, and then to show that the magnetic ordering survives. One can only 
hope this scenario to work for sufficiently large U since magnetic ordering becomes truly a 
nonperturbative phenomenon when the band is not flat. 

As the first step in this direction, the local stability of the ferromagnetic state was 
proved in models obtained by adding arbitrary small short-range hopping terms to Tasaki's 
version of flat-band Hubbard models [4, 5]. In this work, it was also shown that low-lying 
excitation energy above the ferromagnetic state has the dispersion relation expected for a 
magnon excitation. Then it was proved in [6] that a one-dimensional Hubbard model with 
non-flat bands exhibits saturated ferromagnetism for sufficiently large U. The model was 
obtained by adding extra nearest neighbor hopping terms to Tasaki's one-dimensional flat- 
band Hubbard model as in Fig. 4. This was the first rigorous example of ferromagnetism in 
an electron system without any singularities, i.e., with finite interaction and finite density 
of states. Shen [23] announced a computer assisted extension of the proof in [6] to some 
higher dimensional models. The method in [6] inspired similar rigorous works in different 
classes of Hubbard models [24, 25]. In particular Tanaka and Ueda [26] recently succeeded in 
proving the existence of saturated ferromagnetism in a Hubbard model obtained by adding 
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Figure 4: Tasaki's nearly-flat-band Hubbard model in one dimension [6]. The hopping 
amplitude t ra is — v 2 s and v 2 t for the horizontal bonds in the upper and the lower 
rows, respectively, and v[t + s) for the diagonal bonds. The sites in the upper and the 
lower row have on-site potential t r>r which equal t — 2u 2 s and 2v 2 t — s, respectively. 
The model has two bands which are not flat. When N e = |A|/2, the model exhibits 
saturated ferromagnetism for sufficiently large U/s and t/s for any v > 0. When 
v = l/y/2, for example, the appearance of ferromagnetism is proved for t/s > 1.6 and 
sufficiently large U/s. See Theorem 4.2. 

extra hopping terms to Mielke's flat band Hubbard model on the kagome lattice. For closely 
related heuristic works, see [27, 28] and other references in Section 6.6 of [8]. 

The present work is an extension of that in [6]. We extend the theorem in [6] to general 
models in higher dimensions. As was noted in [6] , a straightforward extension of the method 
in [6] applies to a class of higher dimensional models. Instead of using such a method, we 
here present a much more general and simplified proof which naturally covers a more general 
class of models. 

4 Ferromagnetism in typical d- dimensional nearly-flat- 
band models 

In the present section, we concentrate on a simple class of models denned on decorated 
hypercubic lattices, and precisely describe the results of the present paper and our previous 
works. Although our works cover much more general models, it may be useful for the readers 
to see what has been achieved in the context of simple models. In short, we start from a 
concretely defined non-singular model of itinerant electrons, and prove that its low energy 
properties coincide with what one expects in a "healthy" ferromagnet. 

Let S denote (only in the present section) the ci-dimensional Lx- ■ - xL hypercubic lattice 
with the unit lattice spacing and periodic boundary conditions. We let L > to be an odd 
integer. We take a new site in the middle of each bond (i.e., a pair of neighboring sites) in S, 
and denote by X (again only in this section) the collection of all such sites. We shall study 
the decorated hypercubic lattice A = £ Ul in the present section. See Fig. 5. 

We define a Hubbard model on A which is characterized by four parameters t > 0, s > 0, 
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Figure 5: The lattice structure and the hopping amplitudes in the two dimensional 
model. The black dots are sites in £ , and the gray dots are sites in X. (a) shows the 
flat-band model with s = 0, and (b) shows general model with s > 0. 



v > 0, and U > 0. The hopping amplitude of the model is given by 



v(t + s) 



if \r 



1/2; 



-u 2 s 



IdvH - s 
t - 2v 2 s 
K 



if r, s £ £ and \r — s\ — 1; 
if (r, s) e B; 
if r = s G £; 
if r = s G X; 
otherwise, 



(4-1) 



where we set 



B 



|(r,s) 



r, s G X, |r — s| = l/v2> U 



|(r, s) r, sGX, |r - s| = 1, (r + s)/2ef|. 

(4.2) 

There are nearest neighbor and next nearest neighbor hopping amplitudes. See Fig. 5. This 
rather complicated expression for t rs comes from a simple construction in Section 5.3. See 
(5-12). 

One can easily calculate the single-electron properties corresponding to the above hopping 
amplitudes. There are (d + 1) single-electron bands 5 and their dispersion relations are given 

5 The readers unfamiliar with the notion of bands may ignore this part or refer to Appendix E of [8]. 
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by 6 



( _ s _2z/ 2 S E^i(l + cosA; M ) if j 
Sj(k) = < t if j 

[t + 2vHj: d ^l + cos^) if j 



i; 

2,..-,d; 



(4.3) 



Here A; = (k±, . . . ,kd) is the wave vector in the set 




(4.4) 



In the flat-band model with s = 0, all the bands except the uppermost band with j — d + 1 
are dispersionless (or flat) as in Fig. 6 (a). In a general model with s > 0, the lowest band 
becomes dispersive as in Fig. 6 (b). Since our ferromagnetism is supported by electrons in 
the lowest band, it is crucial that the lowest band becomes dispersive. Reflecting the special 
geometry of the decorated lattice, the intermediate bands with j = 2, . . . , d are always 
dispersionless. This, however, is not crucial to low energy behavior of our model. Indeed 
it is not difficult to add proper extra hopping terms to the model to make all the bands 
dispersive while maintaining the existence of ferromagnetism. See Section 6.2. 

We consider the Hubbard model with the Hamiltonian (2.7), the hopping amplitudes 
(4.1), and the electron number N e — \£\ — L d . 

We first recall the result about the flat-band ferromagnetism proved in [2, 3]. (See Section 
6 of [8] for the most compact proof.) 

Theorem 4.1 (Flat-band ferromagnetism) Let s = 0. Then for arbitrary t > 0, v > 0, 
and U > 0, the above model exhibits saturated ferromagnetism. 

As we have stressed in Section 3.4, ferromagnetism takes place for any positive values of 
U in the flat-band models. When the lowest band is no longer flat, saturated ferromagnetism 
cannot take place for too small values of U > 0. This fact can be seen, for example, from 
the following (easy and well-known) theorem. (See Section 3.3 of [8] for a proof.) 

Theorem 4.2 (Instability of saturated ferromagnetism) Let s > and U < Av 2 s. 
Then the lowest energy among the states with Stot = •S'max — 1 is strictly lower than the 
lowest energy among the states with S tot = S max . This means that the ground state of the 
model has S tot < S nmx , and hence the model does not exhibit saturated ferromagnetism. 

6 In our models, all the bands have simple cosine dispersion relations. This is not the case in general 
multi-band systems, and reflects a special character of our hopping amplitudes. In this sense, our models 
may be regarded as a kind of "idealized tight-binding models." Whether such models are useful in studying 
problems other than ferromagnetism is an open question. 
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Figure 6: The dispersion relation (4.3) of the single-electron bands in the two dimen- 
sional model. The horizontal axes represent —it < k±,k2 < vr, and the vertical axis 
denotes the single-electron energy, (a) shows the flat-band model with s = 0, and (b) 
shows general model with s > 0. 
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The theorem, unfortunately, does not tell us what the ground states look like for small 
U . (We nevertheless believe that the model has ground states with S to t = for sufficiently 
small U .) It assures us, however, that the appearance of saturated ferro magnetism, which is 
established in the following theorem, is a purely nonperturbative phenomenon. 

Theorem 4.3 (Ferromagnetism in nearly-flat-band models) Whent/s, U/s, and\/v 
are sufficiently large (how large these quantities should be depend only on the dimensionality 
d ), the above model exhibits saturated ferromagnetism. 

This is a special case of our main theorem in the present paper, Theorem 5.2. For the 
model with d—1, one can prove the same statement for any values of v > 0. See Section 6.2. 
A computer assisted proof of the above theorem for d = 2, 3 (which makes use of an extension 
of the method in [6]) was announced by Shen [23]. 

Moreover our earlier results in [4, 5] about low-lying excitations also apply to the present 
model. For any x G Z d , define the translation operator T x by 



T 




n 4 $ ™ 



6 A 




(4.5) 



where we use periodic boundary conditions to identify r + x with a site in A (if necessary). 
Then, for any k G /C, we let Es W (k) be the lowest possible energy among the states that 
satisfy 5 t ( ^$ = {(A c /2) - 1}$ and T x [$] = e ikx $ for any x. In other words, E sw (k) is the 
lowest energy among the states where a single spin is flipped (from the ferromagnetic ground 
state) and the total momentum is k. Then we have the following theorem. (For more precise 
statements, see Section 3.3 of [5].) 

Theorem 4.4 (Dispersion relation of low-lying excitations) Let E GS be the ground 
state energy. When t/s, U/s, t/U , and \jv are sufficiently large, one has 

F^z/f/^sin^) 2 > F SW (A:) -F GS > F 2 Au'U ^(sin ^) 2 , (4.6) 

/Lt=l " /1=1 

for any k G K. Moreover the constants F 1 and F 2 tend to 1 as s — > and v — > 0. 
Therefore, for sufficiently small s and v, we have an almost precise estimate 

d , 

F SW (A:) -F GS ^4z/V^(sin^) 2 , (4.7) 
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about the low-lying excitation energies. We note that this dispersion relation is what one 
expects for the elementary magnon excitation in a ferromagnetic Heisenberg model on S 
with the exchange interaction J c g = 2u A U. 

To summarize, we have obtained a class of non-singular models of itinerant electrons 7 
(with only spin-independent interactions) whose low energy behaviors are rigorously proved 
to be that of a "healthy" insulating ferromagnet 8 . By a "healthy" insulator, we mean 
an itinerant electron system whose low energy properties can effectively be described by an 
appropriate quantum spin systems. Although we can hardly claim that our model is realistic, 
the similarity with ferromagnetism observed in a cuprate (see Section 7.1 of [8]) suggests 
that our models share some features with some of the existing ferromagnetic insulators. 

Let us finally discuss whether our ferromagnetism is robust against perturbations. We 
note that Theorem 4.4 about the low-lying excitation is still valid when one adds small 
arbitrary translation invariant perturbation to the hopping amplitudes 9 . In other words, 
local stability of the ferromagnetic state is proved for slightly perturbed models. Since it is 
generally believed that local stability of ferromagnetism implies global stability (see [17] for 
a related rigorous result), this strongly suggests that the global stability of ferromagnetism 
(as is stated in Theorem 4.3) is valid for general perturbed models. 

5 The model and main results 

5.1 Construction of the lattice 

We define our lattice and the Hubbard model on it. 

Let us give a brief explanation first. Our lattice A consists of two kinds of sites called 
external sites and internal sites. The sets of all the external and the internal sites are denoted 
as £ and X, respectively. In the model of Fig. 5, for example, the black dots are external 
sites and gray dots are internal sites. The whole lattice A is decomposed into a union of 
overlapping cells. Each cell contains a single internal site and n (n > 2) external sites. An 
internal site u belongs to exactly one cell (denoted as C u ), while an external site x belongs 
to m (m > 2) cells. In Fig. 5, a bond which consists of two black dots and a gray dot is a 
cell. 

7 It is true that the Hubbard model itself is "singular" when compared with more realistic models in 
continuum. But this is a consequence of the way of describing physical systems, and does not necessarily 
mean that underlying system (if any) is singular. We believe, on the other hand, that the models with 
U = oo or Dp = oo have more manifest singularities. 

8 It should be noted that insulating ferromagnets are rather rare in reality. To prove the existence of 
metallic ferromagnetism, in which a set of electrons contribute both to conduction and magnetism, in certain 
version of the Hubbard model is a challenging open problem [8] . 

9 One must also replace Eq$ in the theorem with the lowest energy of the states with S tot = S max . 
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To be more precise, let us define the general lattice by using the "cell construction" as 
in [8]. This allows us to cover a general class of models in a unified manner. 

We fix two integers n, m > 2 which will characterize our lattice. Let the basic cell be a 
set of (n + 1) sites written as 

C = {u,X!,x 2 , • ■ • ,!„}. (5.1) 

For convenience, we call u the internal site of C, and x±,x 2 , • • • ,x n the external sites. 

To form the lattice A, we assemble M identical copies of the basic cell, and identify 
external sites from m distinct cells regarding them as a single site. We do not make such 
identifications for internal sites. We assume that the lattice A thus constructed is connected. 
Usually A becomes a periodic lattice by this construction. 

The lattice A is naturally decomposed as 

A = XU£, (5.2) 

where X and £ are the sets of internal sites and external sites, respectively. From the 
above construction, we see that the numbers of sites in these sublattices are |X| = M and 
\£\ = nM/m. 

See Figure 7 for some examples of the basic cell and corresponding lattices. The examples 
treated in Section 4 are obtained by considering the cell with n — 2, and setting m = 2d. 

We can easily treat models where n and m are not identical for different cells, but we 
here concentrate on the simplest case with constant n and m. (We still can treat a variety 
of lattices by choosing different n, m, and ways of assembling the cells.) 

For an internal site u G X, we denote by C u C A the cell which contains the site u. For 
an external site x G £, we denote by A x C A the union of m cells which contain the site x. 

5.2 Fermion operators 

We define special fermion operators which will be crucial for our analysis. Let v > be a 
constant. (We note that \jv corresponds to A in our previous publications [2, 3, 4, 5, 8].) 
For x G S, let 

0"£,U = C X,U ~ V ^ ] C M,CT5 (5-3) 

ueA x nx 

where the sum is over m internal sites adjacent to x. Similarly for u G X, let 

b u ,a = C u , a +1S ^ C *,°' ( 5 - 4 ) 

xec u \{u} 

where the sum is over the n external sites adjacent to u. 
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Figure 7: Examples of cells and lattices. The black dots represent external sites, and 
the gray dots represent internal sites. (A) From the cell with three sites (n = 2), one 
can form (a) a one-dimensional lattice (which is drawn as the delta chain or the zigzag 
chain in Figs. 3 and 4) by identifying two external sites (m = 2), or (b) a decorated 
square lattice (which also appears in Fig. 5) by identifying four (m = 4). (B) From the 
cell with five sites (n = 4), one can form, for example, (c) another decorated square 
lattice (which will appear in Fig. 8) by identifying four external sites (m = 4). There 
are many similar examples in higher dimensions. 
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From the anticommutation relations (2.1) for the basic c operators, one can easily verify 
that 

{< CT A,r} = (5.5) 

for any x G £ , u G X, and a,r =|, |. This means that the a operators and the b opera- 
tors correspond to distinct spaces of electrons. We shall discuss more about this point in 
Section 5.4. 

The anticommutation relations between the a operators are 

{1 + mis 2 , if x = y, a = r; 
£ x , y u 2 , ii x ^y,a = T; (5.6) 
0, if a ^ t. 

For x, y G £, we defined 

4 ;J/ = lA^nA^n 1\, (5.7) 

which is the number of distinct cells which contain both x and y. For the b operators, we 
similarly have 

{1 + nv 2 , if u — v, a — r; 
4,„z/ 2 , ifu^v, <T = T] (5.8) 
0, if o- ^ r. 

For u, v G X, we defined 

^ = |C u nc„n^|, (5.9) 

which is the number of external sites which are adjacent to both u and v. One sees that a 
operators or b operators simply anticommute with each other if the reference sites are suffi- 
ciently separated. The slightly complicated anticommutation relations (found for sufficiently 
close reference sites) reflect the use of basis states which are localized but not orthogonal 
with each other. 

An important property of the a and b operators is that one can represent arbitrary states 
of the system by using these operators. The key is the following lemma. 

Lemma 5.1 For any r G A and a =], {, one has 

°r,c = ^ lx a X ,a + y^ J Vu K,a, (5.10) 
xd£ mGX 

with suitable coefficients ^ x and r\ u . 



Proof: Consider a Hilbert space which consists of operators of the form J2 r cA ar Cr >°" w ith 
a r G C. We fix a to be either | or J,. The inner product of the two "vectors" J2 r &A ar ° r ,° anc ^ 
J2reA & c r ,a is defined to be the anticommutator {{J2 re A a r c r,aY , J2 r &A Pr c r ,a} = J2 r -eA "7 A- 
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Since {a xa , b u ^ a } = for any x G £ and any uel, the subspace spanned by the set {a X)(7 } xe £ 
and that spanned by the set {b UtCT } ue j are orthogonal. Since a x>a with different x are linearly 
independent, the dimension of the former subspace is equal to \£\. Similarly the dimension 
of the latter subspace is |X|. Noting that \£ \ + |X| = |A| is the dimension of the whole space, 
one finds that the set {a Xi(T } xe £ U {b u<a } ue x spans the whole space. This means that any c rjCr 
can be expanded in terms of a XjlJ and b Uj(7 as in (5.10). ■ 

Recall that the basis states of the many-electron Hilbert space are (2.4). As a consequence 
of the lemma, we find that an arbitrary many-electron state of the system can be represented 
as a linear combination of the basis states 



*M(£ t>J E;, j t , j A ) = n <t n <i n <t n <i (s.n) 




with arbitrary subsets E^E^ C £ and C X. Here \E^\ + \ E^\ + + = N e is the 
total electron number. 



5.3 Definition of the model and the main theorem 

Our model is characterized by the four parameters t > 0, s > 0, U > 0, and v > 0. The 
Hamiltonian of the model on A is 

H = -sJ2 a l,a a x,a + t b l* + U Y1 n ^ n r,h ( 5 - 12 ) 
xe£ uei reA 

<r=T,i <r=T,i 

where the number operator n X|ff is defined in (2.3). 

Recalling the definitions (5.3) and (5.4), one sees that this defines a Hubbard model with 
nearest and next nearest neighbor hopping terms. We can rewrite (5.12) in the standard 
form (2.7) with the hopping matrix given by 

t x ,x = rntv 2 — s, x & £ 

t uu — t — nsu 2 , u G X 

{t + 8h lie, <» 3 » 

tx,y — ^x,y v t, x, y G £ , x ^ y 

k *u,t; = ^ 2 S, U,V <El, U ^ V 

Note that the model has nearest and next-nearest neighbor hopping amplitudes, but not 
more. See Figs. 4, 5 (b), and 8 for examples 10 . 

10 Observe that the lattice in Fig. 5 (b) can be obtained by either setting n = 2, m = 4, or n = 4, m = 2. 
(In the latter case, the black dots correspond to the internal sites.) This means that we have models which 
exhibit saturated ferromagnetism at different electron numbers in different regions in the parameter space. 



tx,u t u x 
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Figure 8: Another example on the fee like lattice in two dimensions obtained by setting 
n = m = 4. 



We consider the Hilbert space with the electron number fixed to N e = \£\ = nM/m. 
Note that this electron number is consistent with the interpretation that an external site 
represents a metallic atom which emits one electron to the system. 

Exactly as in Theorem 4.1, it can be shown that the flat-band models with s = exhibit 
saturated ferro magnetism for any t > 0, v > and U > 0. See Section 6 of [8] for a proof. 
The instability of saturated ferromagnetism for sufficiently small U as in Theorem 4.2 can 
be of course proved for the general models. See Section 3.3 of [8]. 

Our main result is the following theorem which shows that the ground states of the model 
exhibit saturated ferromagnetism 11 . 

Theorem 5.2 When t/s, U/s and \jv are sufficiently large (how large these quantities 
should be depends only on the local structure of the lattice, but not on the size of the lattice ), 
the ground state of the model is (N c + l)-fold degenerate and has the total spin S tot = N e /2. 

In the proof of the theorem, we further show that one of the ground states is written as 




(5.14) 



and other ground states are obtained by applying the spin lowering operator S, 
onto the state (5.14). 



'tot — 



11 



Theorem 4.4 about the low-lying excitation is valid in a wide range of models. See [5]. 
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5.4 "Band" structure in the single-electron problem 

Before proceeding to prove the theorem, we discuss a basic property of the single electron 
problem associated with the present model. Although the discussion is not necessary for the 
proof of the main theorem, it sheds light on a special character of the model that we are 
studying. 

The single electron Hilbert space f) is the | A (-dimensional linear space spanned by c\ |$ vac 
with r G A. (We here consider the space of up-spin electrons, but this choice is not essential.) 
This space is decomposed as 

f) = J)L©f)u, (5.15) 

where h L is spanned by a) x |$ vac with x E £, and f)u by b\ |$ V ac with u EX. Then we have 
the following. 

Proposition 5.3 The Hamiltonian H can be diagonalized within f) L and within f) U; respec- 
tively. The energy eigenvalues e in f) L satisfy 

-s{l + (m + £i> 2 } < e < min{0, -s{l + (m - 4> 2 }}, (5.16) 

where £l = J2 y es y ^ x ^x,v x e £> an( ^ ^ e ener 9U eigenvalues e in f)u satisfy 

max{0, t{l + {n- £v)is 2 }} < e < t{l + (n + £u)v 2 }, (5.17) 

where £ v = J2vez,v^u ^u,v with u El. 

The proposition states that the spectrum of the Hamiltonian H in the single electron 
Hilbert space I) consists of two distinct "bands." When v is sufficiently small (which is the 
case when the theorem holds) , the two "bands" do not overlap and are separated by a finite 
gap. The fermion operator a\^ a creates an electron in the lower "band" , and b\ c creates an 
electron in the upper "band." 

When the model has a translation invariance as in the models of Section 4, the single 
electron Hilbert space is decomposed into several bands in the standard sense. The lower 
or upper "band" that we mentioned above is not necessarily a band in the usual sense, but 
may be a union of several bands. In the band structure (4.3) discussed in Section 4, the 
lowest band with j = 1 corresponds to the lower "band" , and the collection of the remaining 
d bands corresponds to the upper "band." 

Proof: The proof is elementary but requires some care. 
Consider a state <3> of the form 

$ = ^Or^vac, (5.18) 
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where (p(x) are complex coefficients. From the anticommutation relations (5.6), one finds 
that 



H$ = -^s\(l + mv 2 )^{x) + v 2 J2 e *Mv) \ 4, T ^ 



(5.19) 



x&S 



yes 

y^x 



Since the right-hand side is again a linear combination of al^ vajC , we find that H can be 
diagonalized within f) L - We now assume if<3> = e$. By comparing the coefficients in (5.18) 
and (5.19), we find 



-s(l + mv 2 )ip(x) - sis 2 ^2£ Xty (p(y) = e<p(x). 



(5.20) 



ye£ 

y+x 



By multiplying (5.20) with (p(x), by summing it over x G £, and by solving it for e, one gets 

/ __ \ 

s(l + mv 2 ) \v{x)\ 2 + sv 2 ^ £ x, y <p(x) (p(y) 



e = 



V 



x,y&€ 
x^y 



(5.21) 



/ 



By using the inequalities 



-(|v9(x)| 2 + |v9(y)| 2 ) < <p(x) <p{y) + <p(y) < \ V (x)\ 2 + |^(2/)| 2 , 
which follow immediately from \tp(x) ± tp{y)\ 2 > 0, we find from (5.21) that 
-s(l + mv 2 ) - sv 2 £ L < e < -s(l + mv 2 ) + sz/ 2 4. 



(5.22) 



(5.23) 



This, with the positive semidefiniteness of aJ, CT a Zi(J , proves the desired (5.16). The other 
inequality (5.17) is proved in exactly the same manner using the b u ^ operators. ■ 



6 Proof 

6.1 Proof of the main theorem 

The basic strategy of the proof is first to show the appearance of ferromagnetism in a local 
piece of the system, and then to "connect" these local ferromagnetism together to get the 
desired ferromagnetic state on the whole system. Of course such a "connection" usually 
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does not work in itinerant electron systems where electrons behave as "waves." Our method 
makes a full use of special features of our model as well as of ferromagnetic states. 
Our proof is based on the decomposition of the Hamiltonian 

where h x acts only on the sublattice A x . The local Hamiltonian h x is defined as 

h x = sJ2 a i,* a x,* + l ~ h u,oK° + ^ n v>l n v>l + \ Yl n ^n u ,i, (6-2) 

<r=T,i u£A x m y&A x ns ueA x m 

where n' = \A X C\£\. It should be stressed that h x with neighboring x do not commute with 
each other. One therefore cannot diagonalize all h x simultaneously. 

As for the lowest eigenvalue and the corresponding eigenstates, however, we have the 
following. This lemma plays a key role in our proof of the theorem. 

Lemma 6.1 When t/s, U/s and \ jv are sufficiently large, the lowest eigenvalue of h x is 
— s(l + mv 2 ), and any corresponding eigenstate $ can always be written as 

$ = < T $i + 4 a $ 2 , (6.3) 

where $i ; $2 (ire suitable states with N c — 1 electrons. The eigenstate $ also satisfies 

c riT c ra $ = 0, (6.4) 

for any r G A x . 

We shall prove Theorem 5.2 assuming Lemma 6.1. The lemma will be proved in Sec- 
tion 6.3. 

Since h x > —s(l +mis 2 ), we have H = ^ xe£ h x > — s(l + mv 2 )\£\. A straightforward 
calculation using the anticommutation relations (5.5) and (5.6) shows that the state (5.14) 
is an eigenstate of H with the eigenvalue — s(l + mv 2 )\£\. Therefore we see that the state 
(5.14) is a ground state. Our goal here is to characterize all the ground states. 

Let $ be an arbitrary eigenstate of H with the eigenvalue — s{l + mv 2 )\£\. Then it follows 
from h x > —s(l + mu 2 ) that 

h x $ = -s(l + mis 2 )®, (6.5) 

for any x G £. Thus $ satisfies the properties stated in Lemma 6.1. 

Let us expand $ in the basis states ^q\e^, Jj, 7|) of (5.11). Since $ satisfies (6.3) 
for any x G £, it follows that only those basis states with U E± = £ contribute. Since the 
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electron number is \E^\ + \Ei\ + |7|| + |/J = N c = \£\, the condition U E± = £ implies 
£j- fl Ei — and /j = I± = 0. Therefore the expansion of <3> in the basis states (5.11) can be 
rearranged into a "spin system representation" as 

$ = E^)(ri a U)) <|) vac, (6.6) 

where cr = (a(x)) xe £ is summed over all the spin configurations with a(x) =|, |, and ip(cr) 
are complex coefficients. 

We then examine the property (6.4) for u G X. From the definition (5.3), we find that 
for any u G X 

c «,t c «,i n = e sgn ^' ^ =Tj ^ = ^ n $ ™ c ' 

Wf / y,zeC u \{u} \xee\{y,z} J 

y+z 

(6.7) 

where the sign factor sgn(y, z) comes from the anticommutation relations, and satisfies 
sgn(y, z) = —sgn(z,y). The characteristic function %[•] is defined as usual by x[t rue ] — 1 
and x [false] = 0. By using (6.6) and (6.7), we find for any u G X that 

c uA c ud $= Yl J2 s My,z)xHy) =h°(z) =l]{4'(<7)-4'(<7y^z)} [ JJ a£ >ff(x) $ va c, 

y,zeC u \{u} o- \:re£\{s/,z} / 

(6.8) 

where we have introduced an arbitrary ordering in £ to avoid double counting. The spin 
configuration <T y ^± z is obtained from cr = (a(x)) xe s by switching <r(y) and cr(z). Since the 
basis states in the sum (6.8) are all linearly independent, we find from the property (6.4) 
that 

^)=^M. (6-9) 

for any y,z G £ for which there is u G X such that y,z E C u . Since the whole lattice is 
connected, (6.9) implies that all i/j(<t) with the same M = J2 x e£ a ( x ) are identical. This 
completes the characterization of the space of the ground states. The ground state which 
has a fixed total spin in the z-direction is 

^= e (n Cm)*-, (6.io) 

cr \xe£ ) 

E«^(x)=Af 
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where M = -(|£|/2), 1 - (|£|/2), , • • • , (\S\/2) - 1, \£\/2. Thus the ground states are \S\ + 1 
fold degenerate. It is easy to check that 

(S tot ) 2 $Af = S max (S max + 1)^m, 5S$ M = M$ M , (6.11) 
with ^max = |£|/2 being the maximum possible value of the total spin of iV e = |£| electrons. 

6.2 Some extensions 

Let us make two brief remarks about extensions of Theorem 5.2. 

The first extension deals with the one dimensional model of Fig. 4, which (in the notation 
of Section 5.1) has n = m = 2. In this model, Tanaka [29] observed that the statement of 
Theorem 5.2 can be proved if one first fixes arbitrary v > and then takes sufficiently large 
t/s and U/s. 

To show this extended theorem, one proves the statement corresponding to Lemma 6.1 
by the method we used in [6] to prove the similar lemma. The differences between the lemma 
in [6] and that in the present paper comes from a difference in the definitions of the local 
Hamiltonian. Unlike the definition (6.2) in the present paper, we did not include the on-site 
repulsion terms from the external sites other than x in the local Hamiltonian used in [6]. 
This seemingly minor difference indeed makes a considerable difference in the conditions 
that we obtain in the limit U — > oo. The same method as in [6] thus yields much stronger 
information for the local Hamiltonian defined as in the present paper 12 . We leave the details 
to the interested readers. 

The second extension is much more straightforward and less important. For arbitrary 
complex coefficients /„, define 

b=J2 (6,12) 

which is obviously positive semidefinite. From the expression (6.11) for the ground states 
and the anticommutation relations (5.5), one readily finds that 5$ = for any ground state 
$. 

This means that one may add to the Hamiltonian the new hopping terms 

ff' = EE {Etfwj'jEtfw}, (6.i3) 

12 After the publication of [6], Kubo [30] and Shen [31, 23] independently noticed the importance to include 
the on-site repulsions from the external sites in the local Hamiltonian. 
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with arbitrary /„ ' without modifying the ferromagnetic ground states. In this manner, one 
can modify, for example, the models in Section 4 so that all the bands become dispersive 
maintaining the appearance (and the provability) of saturated ferromagnetism. 



6.3 Proof of Lemma 6.1 

It suffices to prove the lemma for h Q with a fixed o G £. Since h Q acts only on A Q , we only 
consider an electron system defined on A Q without specifying the electron number. We also 
write £ Q = A D n £ and T Q = A Q n X. The local Hamiltonian that we consider is 

h = sJ2 a laa>o,o + ^~Y1 h n,oK° + 3 Yl n ^^x,i + ^;Y1 n ^ n ud- (6-14) 

it it it 

<T=t,4. u£lo XG£o U&Xo 

We stress that the statement of the Lemma is about the property of a finite dimensional 
matrix h . It is thus possible (in principle) to prove the lemma for fixed n, m by using a 
computer 13 . But the problem is indeed rather delicate, and the proof for general cases seems 
highly nontrivial. 

As we have restricted our lattice, we redefine (only in this proof) the operator a XtCr for 
x G £ \{o} as 

a x ,a = c x>a -v ^ c u,v (6-15) 

The definition of b UjC is unchanged. Note that we still have {a xa ,b U!T } = 0. 

Exactly as in (5.11), any state defined on A c can be written as a linear combination of 
the basis states 

^\E h E l j h i l )=(i[aiA ( n<i) (rKt) (n 6 u*™> ( 6 - ie ) 

J \xeE l J y«e/ T J \uei l J 

with arbitrary subsets E^E^ C £ Q and C X G . Here we do not fix the electron number 
which is given by \E^\ + \E±\ + |Jf| + 



6.3.1 The limit *, U -> oo 

Let us first consider the limit where t — > oo and U — > oo. It is easily found that the 
lowest eigenvalue of h Q is finite in this limit. (Try, for example, the state a\^ va _ c .) Note 
that the parts in h which contain t are (t/n) ^2 ue j o X1<t=t j. ^u,a^u,a, and which contain U 

13 The numerical values of t/s in the caption to Fig. 4 was obtained by using a computer. See also Fig. 2 
of [6]. Shen [23] has done this for some models in higher dimensions. 
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are (U/n') YL x e£ n x,i n x,i + (U/n) J2 ueXo n u ^n u ^. Since each term in these sums is positive 
semidefinite, the necessary and sufficient condition for a state $ to have a finite energy in 
the limit t — > oo, U — > oo is 

= 0, (6.17) 

for any u G X D and a =|, j, and 

c riT c ra $ = (6.18) 

for any r G A Q . To get (6.18), we noted that n r ^n r> i = (c T ^c r ^c r ^c T ^. 

To see implications of the condition (6.17), we introduce dual operators b ua for u G X D 
and a - =|, | which satisfy 

{6u,<r,&J )T } = <J ff)T , (6.19) 

for any -u,t> G X Q and cr, r =|, |. More precisely the construction is as follows. Define the 
Gramm matrix G by (G) U; „ = {6^,6;^}, where u, v run over X . The linear independence 
of the basis states implies that G is invertible. For u G X G and a =f, |, define 

V» = XI ( G_1 )«^ ( 6 - 20 ) 

where it is easy to check (6.19). 

From (6.19) and (6.16), one sees that 

blX^\E h E b I hh ) = lf\E h E l ,I h I l ), if « G (6 . 21) 

^ U, otherwise, 

for any u G X D and cr =T> !■ Let ^ be an arbitrary state satisfying (6.17). Then since b UjC is 
a linear combination of b w ^, one has 6 Ui0 -$ = and hence 

b[Ju,^ = 0, (6.22) 

for any u G X„ and a =|, |. Then from (6.21) and the linear independence of the basis states 
(6.16), one finds that the state $, when expanded in the basis states cannot include 
Ihh) witn nonempty J T or I t . 
Therefore we conclude that $ is a linear combination of the basis states 



^(E^E,) = ^\E h E b ^) = ( II <T 1 ( II <i 1 (6.23) 
with arbitrary E^, E± C £ Q . 




<x€Ef I \xeE< 
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We then examine the condition (6.18) for r G S Q . Noting the definitions (5.3), (6.15) of 
a x ^ and (6.23), we see that 

for any x G £ c . Now the condition (6.18) for r G £ Q implies 

a l,l a U c ^ c *,l® = 0, ( 6 - 25 ) 

for any x G £ Q . Then, as before, we see that $ is a linear combination of ^^(E^, E^) with 
E h E i C £ G such that £ T n = 0. 

For the state $ to have a finite energy in the limits t,U — > oo, it must further satisfy 

c u , T Ca$ = (6.26) 

for any if G X D . This condition is not as straightforward to treat as the previous two 
conditions. 

To see implications of (6.26), we first note that when D E± — 0, 

CuAC^^iE^E^ = v 2 J2 X[x eE h ye £j sgn(x, y; E h E{) ¥ 2 u \E^\{x}, E^y}), 

x,y€C u \{u} 

(6.27) 

where we used the definitions (5.3), (6.15) of a XyCr , and (6.23) of ^^(E^, E±). Here x['\ is the 
characteristic function as before, and sgn(x, y; E-^, E±) = ±1 is the sign factor coming from 
anticommutation relations. 

Let us then expand the state $ as 

$= £ ^(E T ,E i )^ ) (E T ,E i ). (6.28) 

_B T n£|=0 

The zero energy condition (6.26) for any u G T Q implies certain relations that the coefficients 
(p(E^Ei) must satisfy. Noting that the parameter v appears in (6.27) only as a prefactor, 
one finds that these relations for ip(E^, E±) depend only on the lattice structure and do not 
depend on v at all. Although the precise forms of the relations are not needed here, let 
us write them down for completeness. The conditions that the coefficients (p(E^,E±) must 
satisfy are 

x[x^E' v y^E{)sgn(x,y;E\U{x},E{U{y})ip(E\U{x},E{U{y}) = 0, (6.29) 

x,yec u \{u} 

x+y 
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for any E'pE{ C £ Q such that Ei fl E{ — 0, and for any u G X . 

For v > 0, we let be the space of all $ which are expanded as (6.28) with the 
coefficients (p(E^Ei) satisfying the conditions (6.29). For v > 0, the space Ti^ is precisely 
the space of all $ which have finite energy (expectation value) in the limit t, U — > oo. The 
space Ti.^ has no such interpretation, but it is convenient to define this space. Note that 
7~4h^ depends continuously on v > since the range of allowed coefficients ip(EpE^) is 
independent of v and the basis states ^^(EpE^) are continuous in v. 

We also let be the orthogonal projection onto the space Ti^- Again vj£ is contin- 
uous in v > 0. 

For v > 0, to study finite energy states of the local Hamiltonian h Q in the limit t, U — > oo 
is equivalent to study the effective Hamiltonian 

h ( £=V&Wv£, (6.30) 

where 

^ = ~ S tt o,a a o,a- (6.31) 

Again we extend the range of v and define by (6.30) with z/ = 0. Since is continuous 
in v > 0, the effective Hamiltonian is also continuous in v > 0. 

It follows from the standard argument that the eigenvalues of the local Hamiltonian h Q 
with given v > are classified into two sets. In the limit t, U — > oo, the eigenvalues in the 
first set diverge, while those in the second set converge to the eigenvalues of including 
the degeneracies. 

Our next task is to investigate the eigenvalues of h^g . But this is still a nontrivial problem 
since vi u J and do not commute. 



6.3.2 The case v = 

Let us set v = and study . Although is not really an effective Hamiltonian, we get 
crucial information about by studying hf^. 

For v = 0, the operator a x>a is nothing but the basic fermion operator c Xj<7 , and the part 
(6.31) of the local Hamiltonian becomes 

h^ = -sn 0) (6.32) 

where n Q = n ^ + n ^ is the number operator. The problem becomes that of electrons 
strictly localized at sites in S Q , except for the projection operator V^. The existence of the 
projection makes the problem nontrivial. 
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Let us decompose the space Tifl as 



Hfl=S^®V^®M {Q) . (6.33) 

Here consists of all $ G Tifl which satisfy n Q $ = $. In other words, is a set of 
states in H.^ with singly occupied o. Any $ G is written as a linear combination of 
^2 ] ( E h E l) with E^UE t 3 o. Similarly V (0) consists of all $ G Tifl which satisfy n c $ = 0. 
It is a set of states in H.^ with vacant o. Any $ G is written as a linear combination of 
#5, 0) (£ T , £j with E^UE^ o. The space Ai^ is defined as the orthogonal complement. 

Note that and are never empty since cl a <& vac G and cJ, )(T $ vac G where 
a; G £ \{o}. On the other hand, is empty in models with n = 2. Since the following 

argument becomes almost trivial when is empty, we shall assume that is not 

empty. 

Any & E is uniquely decomposed as 

$ = 4 >T $i + 4 a $ 2 + $3, (6.34) 
where (« = 1, 2, 3) satisfy n $; = 0, i.e., o is vacant in these states. We then define 



a 



sup 



(6.35) 



and note that 



To see this, suppose that a 



Then a — 1 implies 



a < 1. (6.36) 

1. Since Ai^ is closed there is $ G Ai^ which attains a — 1. 

which means $3 = 0. But 



,t 



this means $ G <S^, which contradicts with (6.33). 
Now from (6.32), one has 



-s$, 
0, 



if $ G 

if $ G V (0) . 



(6.37) 



Thus $ in or V^ ^ is an eigenstate of . The remaining eigenstates are within the 
space M^. 



As for $ G .M (0) , one has 



^(0) p (0) $ 

"* ^?«o(< T $l + <^ 2 + $3) 



(6.38) 
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Therefore 



$,/$*) = - a ($,^« T $ 1 + c t cj> 2 )) 



(°) f „t 



> 



l$l 



and we get 



(<M2a) 



1$ 



|2 — 



> -s 



l$l 



> — as > — s, 



(6.39) 



(6.40) 



where we used (6.35) and (6.36). By the variational principle, we see that the eigenvalues of 



hfj within the space Ai^ are not less than — 



as. 



Thus we found that the lowest eigenvalue of is — s and its degeneracy is equal to the 
dimension 7 of the space Note that 7 is not vanishing since is not empty. There 
is a finite gap above the lowest eigenvalue. 



6.3.3 Non-limiting cases 

By using the properties of and the continuity in v > 0, one finds that h^g has 7 low 
lying eigenvalues which are separated from larger eigenvalues by a finite gap, provided that 
v > is sufficiently small. By recalling the remark at the end of section 6.3.1, one finds that 
for sufficiently small v > and sufficiently large t and U, the local Hamiltonian h has 7 low 
lying eigenvalues which are separated from larger eigenvalues by a finite gap. 

In what follows, we shall explicitly find these low lying eigenvalues (all of which will turn 
out to be equal to — s(l + mu 2 )), and characterize all the corresponding eigenstates. 

For v > 0, we define as the space of all $ G H^l which are written as linear 
combinations of ^^(E^, E^) such that E^ D E^ 3 o. is not empty since al a ^ vac G «S^. 

In other words, is a set of all $ which are expanded as (6.28) with the coefficients 
(f(E^Ei) satisfying the conditions (6.29) and an additional condition that (p(E^E{) = 
unless E^ U E± 3 o. Again we see that the set of allowed coefficients {<f(E^, E±)} is 
independent of v. Since the basis states ^^\E^,E^) are mutually linear independent for 
each fixed v > 0, we find that for different v > are all identical as linear spaces. In 
particular for any v > has the same dimension as the space i.e., 7. 

Note that any $ G can be written uniquely in the form 

$ = 0^1 + 0^2, (6.41) 

where $1 and $ 2 are linear combinations of ^\E^, E{) with £j fl E± = and £f U E± $ o. 

We will show that, for arbitrary t, U, and v, any $ G is an eigenstate of the local 
Hamiltonian h Q with eigenvalue — s(l + mz/ 2 ). Note that this eigenvalue converges to — s 
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as v — > 0, and is 7-fold degenerate. These facts imply that we have precisely located the 
7 low lying eigenvalues of h for sufficiently large t, U and sufficiently small v. These low 
lying eigenvalues turned out to be completely degenerate, and forming the lowest eigenvalue. 
Since $ G has all the properties declared in the lemma, this leads us to the lemma. 

Let $ be an arbitrary state in S^ U K It only remains to prove that /i $ = — s(l + mz/ 2 )$. 
By construction we have &t ;(7 & U)0 .$ = for any iiGl„ and a =], j, and n r ^n r ^ = for any 
r G A G . Thus we only need to show that 

J2 < a a 0iCT $ = (l+mz/ 2 )$. (6.42) 

<r=U 

From the expression (6.41) and {al a , a 0>cr } = 1 + mu 2 , one has 

^ < CT a 0iCT $ = (1 + mz/ 2 )$ + (6.43) 

where 

$ ' = 4,T I S a U a »." ) $1 + fl o,| ( ^ a l^o,a I $2- (6.44) 

\o-=T,J. / Vo-=T,l / 

On the other hand, from the expression (6.41) and the zero energy condition (6.26), one 

has 

c„,tc Uii $ = c uA c ud al^ 1 + c uA c ud ai d <$>2 

= 0, (6.45) 
where we used (5.3). By operating a) o ^ from the left, the final two lines yield the relation 

<T a li( c «.i $ i- c «,T $ 2)=0, (6.46) 

for any u G X . 

By recalling that a 0jCr = c 0jCr — vJ2 u ei Cu < a anc ^ n °ti n g that c 0)CT $i = c 0i0 -$2 = 0, we can 
rewrite (6.44) as 

= 0, (6.47) 
where we used (6.46). Recalling (6.43), this completes the proof of the lemma. 
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